CDS

Accession Number TCMCG037C10966
gbkey CDS
Protein Id XP_022139604.1
Location complement(join(1302171..1302230,1303612..1303778,1305150..1305255,1305433..1305470,1305720..1305807,1306453..1306506,1306591..1306699,1306816..1306898,1307003..1307065,1307176..1307238,1308034..1308132,1308346..1308384,1308480..1308535,1308797..1308911,1309025..1309194,1309604..1309707,1309849..1309943,1310258..1310340,1310417..1310609))
Gene LOC111010464
GeneID 111010464
Organism Momordica charantia

Protein

Length 594aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA397875
db_source XM_022283912.1
Definition imidazole glycerol phosphate synthase hisHF, chloroplastic [Momordica charantia]

EGGNOG-MAPPER Annotation

COG_category E
Description Belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAAGCGCCGCCGTTCGCCGTCGCTGGTTCTTCTTCCTCTTCTTCTTCTCAGATTGTATTTCGATCACTTTCCTCATCGCCTCGTACGAGCTCTCTCTTTTTTCTTCCCAATAGTTGTTATAAAACTCGTCATCTCAAAGTCAAGTCCTCTGGCAAGTTTGCTGTTCGCGCCTCATTTGCTGGTGACTCAGTCGTGACTTTGCTGGATTACGGTGCTGGTAATGTTCGTAGTGTGAGGAATGCAATTCGTTACCTTGGCTTTGATATCAAAGATGTGCAAACACCAGAAGACATTCTAAATGCAAACCGCCTAATATTTCCTGGAGTTGGGGCATTTGCTCCGGCCATGGACGTGCTAAACAATAAAGGCATGGCTGAAGCACTCTGCACTTATATTGAGAATGATCGCCCATTTTTAGGAATTTGTCTTGGGCTTCAACTACTCTTTGAATCAAGCGAGGAGAATGGACCAGTAAAAGGTCTTGGCTTAATACCGGGTGTGGTTGGGCGTTTTGACTCTTCCAATGGTTTTAGGGTACCCCATATTGGGTGGAATGCTCTGGAAATCTCTAAGGACTCTGAGATCTTGGATGATATTTCTAATCGACATGTCTACTTTGTTCACTCTTACCGTGCTATGCCATCAGATGAGAACAAGGAGTGGATCTCTTCTACTTGCAGCTATGGCGACAGGTTTATAGCTTCAGTTAGAAGGGGAAATGTCCATGCAGTTCAATTCCACCCAGAAAAGAGTGGAGATGTAGGTCTGTCTGTCCTAAGAAGATTCTTGTTTCCAAAGTCGACTGTCACCAAGAAGCCCAGTGAGGGAAAGGCTTCAAGGCTTGCAAAAAGGGTAATTGCTTGTCTTGACGTGCGAACAAATGACCAAGGGGACCTTGTTGTTACCAAAGGGGACCAATATGACGTAAGGGAGCAAACAGAAGAGAATGAGGTTAGGAACCTTGGCAAGCCGGTAGATCTTGCTGGACAGTACTACAAAGATGGAGCTGATGAGGTCAGTTTTTTGAATATAACTGGTTTCCGTGACTTCCCTCTGGGCGACTTGCCAATGTTGCAGGTGCTGAGATACACATCAGAAAATGTTTTTGTACCATTGACTGTTGGGGGTGGAATTAGAGATTTTAAGGATGCGAATGGCAGACACTATTCTAGCTTGGAAGTTGCTTCAGAATATTTCAGATCTGGAGCTGATAAAATATCTATTGGAAGCGATGCAGTTTATGCTGCTGAGGAATATTTAAGAACTGGCGTAAAGACGGGAAAGAGCAGCTTGGAGCAGATTTCTAAGGTTTATGGAAATCAGGCTGTTGTGGTTAGTATTGATCCTCGTAGAGTGTACCTTAAAAGTCCTGATGATGTGGAGTTCAAAGTTATACGAGTAACAAACCCAGGTCCTAATGGAGAAGAATATGCATGGTATCAGTGTACAGTTAACGGAGGTCGAGAAGGTCGACCAATTGGAGCTTATGAGCTTGCAAAAGCAGTAGAGGAGCTAGGAGCTGGAGAAATACTGTTAAACTGCATAGATTGTGACGGTCAAGGAAAAGGATTCGATATAGATCTAGTAAAGCTGATATCAGATTCTGTTAGCATACCCGTTATTGCCAGTAGTGGAGCTGGGTCTTCTGACCATTTCTCTGATGTGTTTAACAAAACGAATGCTTCTGCTGCTCTTGCTGCTGGAATTTTTCATCGTAAGGAGGTGCCGATTCAGTCCGTAAAAGAGCATTTATTAAAGGAAGGCATAGAAGTGAGAATCTAA
Protein:  
MEAPPFAVAGSSSSSSSQIVFRSLSSSPRTSSLFFLPNSCYKTRHLKVKSSGKFAVRASFAGDSVVTLLDYGAGNVRSVRNAIRYLGFDIKDVQTPEDILNANRLIFPGVGAFAPAMDVLNNKGMAEALCTYIENDRPFLGICLGLQLLFESSEENGPVKGLGLIPGVVGRFDSSNGFRVPHIGWNALEISKDSEILDDISNRHVYFVHSYRAMPSDENKEWISSTCSYGDRFIASVRRGNVHAVQFHPEKSGDVGLSVLRRFLFPKSTVTKKPSEGKASRLAKRVIACLDVRTNDQGDLVVTKGDQYDVREQTEENEVRNLGKPVDLAGQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRYTSENVFVPLTVGGGIRDFKDANGRHYSSLEVASEYFRSGADKISIGSDAVYAAEEYLRTGVKTGKSSLEQISKVYGNQAVVVSIDPRRVYLKSPDDVEFKVIRVTNPGPNGEEYAWYQCTVNGGREGRPIGAYELAKAVEELGAGEILLNCIDCDGQGKGFDIDLVKLISDSVSIPVIASSGAGSSDHFSDVFNKTNASAALAAGIFHRKEVPIQSVKEHLLKEGIEVRI